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Abstract 

The  manner  in  which  the  data  base  is  segmented  affects  the  proba- 
bility of  intent  conflict  and  the  time  required  for  retrieving  neces- 
sary fields.   This  paper  examines  a  basic  segmentation  problem  in  which 
two  fields  in  a  data  base  are  required  in  different  manners  by  three 
messages  entering  into  different  terminals.   The  problem  is  interpreted 
as  a  finite  state  Markov  chain  and  formulated  into  two  alternative 
methods  of  segmentation. 


Introduction 

A  "segment*1  may  be  a  field  or  a  group  of  fields  that  can  be  re- 
trieved from  a  data  base  as  a  unit  of  data  by  a  data  base  management  sys- 
tem. With  systems  such  as  IMS  Version  2,  the  concurrent  updating  of 
segments  belonging  to  the  same  type  by  more  than  one  message  creates  an 
intent  conflict  in  which  only  one  message  can  have  access  to  the  segment 
and  the  remaining  messages  have  to  wait  in  queue  for  their  turns.  With 
those  systems,  the  intent  conflict  usually  represents  a  major  cause  of 
deterioration  in  the  response  time  to  a  terminal  message  during  a  peak 
load  period.   Obviously,  the  manner  of  segmenting  a  data  base  will  impor- 
tantly affect  the  probability  of  intent  conflict  as  well  as  the  time 
required  for  retrieving  necessary  fields.   This  study  examines  a  basic 
segmentation  problem  in  which  two  fields  are  required  in  different  manners 
by  three  messages  from  different  terminals.   The  problem  is  interpreted 
as  a  finite  state  Markov  chain  and  formulated  under  two  alternative  meth- 
ods of  segmentation. 

Alternative.  Gases 

Consider  a  situation  in  which  three  terminal  x,  y,  and  z  that  send 
in  messages  concerning  two  fields  a  and  b  in  a  data  base.   Specifically, 
terminal  x's  messages  need  only  field  a,  terminal  y's  messages  only  field 
b,  and  terminal  z's  messages  both  fields  a  and  b.   There  are  two  ways  to 
segment  the  data  base  with  fields  a  and  b,  as  is  illustrated  by  Cases  I 
and  II  in  Figure  1;  in  Case  I,  fields  a  and  b  are  assigned  to  two  differ- 
ent segments  whereas  in  Case  II  both  fields  are  assigned  to  the  same 
segment.   Regardless  of  the  method  of  segmentation,  messages  from  terminal 


z  always  call  for  fields  a  and  b  in  that  order.   It  is  assumed  that  during 
the  peak  load  period  each  terminal  is  saturated  with  messages;  the  opera- 
tor of  a  terminal  can  key  in  a  new  message  as  soon  as  the  processing  of 
the  preceding  message  is  completed;  and  the  keyed-in  message  waits  in  the 
system  at  least  an  epoch  before  being  processed.   In  any  event,  only  one 
message  from  a  terminal  can  be  in  the  system  at  any  time. 

Two  types  of  messages  are  considered;  one  type  is  query  and  the  other 
type  update.  Let  d  (q)  and  d  (u)  represent  the  proportions  of  query  and 
update  messages  sent  by  terminal  g,  g  e  G  =  {x,  y,  z}.   Then  we  have 

(1)  d  (q)  +  d  (u)  =1  g  e  G 

The  processing  of  a  query  message  sent  by  a  terminal  can  be  carried 
out  independent  of  the  current  messages  from  the  other  terminals.   But 
update  messages  from  different  terminals  trying  to  process  the  same  seg- 
ment type  will  cause  intent  conflict.   In  this  case,  priorities  in  message 
processing  are  in  the  order  of  messages  from  terminals  z,  y,  and  x. 
Case  I:   Two  Fields  Assigned  to  Diff erent  Segments 

In  this  case,  terminal  z  must  always  key  in  a  pair  of  messages  to 
process  fields  a  and  b,  in  that  order. 

The  time  required  for  searching  and  processing  a  field  is  simply 
called  the  process  time  and  assumed  as  a  stochastic  variable  with  an  ex- 

—  Qrt 

ponential  distribution  function  given  by  e    where  a  is  the  probability 

of  completing  the  process  given  by  a  reciprocal  of  a  mean  process  time. 

The  mean  process  time  and  the  process  completion  probability  for  terminal 

g,  g  e  G,  are  represented  by  t  (m)  and  a  (m)  for  the  message  type  m  given 

g        g 

as  an  element  of  M,  M  *  {q,  u}  where  q  represent  a  query  and  u  an  update. 


For  terminal  z,  m  is  subscripted  with  h,  an  element  of  set  H  «*  {a,  b},  to 
indicate  the  field  involved.   Thus  we  write: 

a  (m)  =  a  (m  )  =  1/t  (m) 
x       z  a       x 

(2) 

a  (m)  *  az(\)   =  l-/t  (m)  m  e  M 

Since  the  terminal  can  put  in  only  one  message  in  the  system  at  any 
time,  its  status  is  determined  by  the  type  and  current  state  of  its  mes- 
sage in  the  system.   Since  terminals  x  and  y  deal  with  only  one  field, 
their  statuses  are  given  by  elements  of  set  I  »  {q,  q^  u,  u'}  where  q  or 
u  represent  "a  query  or  an  update  in  process",  and  q'  or  u'  "a  query  or 
an  update  in  queue".   The  status  of  terminal  z  is  given  by  an  element  of 
set  K.  =  (qh»  q/>  u,  ,  u,  }  where  h  is  an  element  of  set  H  =  {a,  b} ,  indi- 
cating the  field  currently  involved. 

Let  P  [s  ,  s_]  represent  the  transition  probability  with  which  the 
status  of  terminal  g  changes  from  s  at  an  epoch  to  s«  at  the  next  epoch. 
If  we  let  k,  and  k„  represent  s  and  s^  for  terminal  z>  transition  proba- 
bilities of  terminals  x  and  y  are  given  as  follows: 

Pg[ql  qj  -  1  P  [q,  q]  -  1  -  ag(q) 


(3) 


PJq>  qf]  *  a  (q)d  (q)  P  [q,  u'  ]  -  a  (q)d  (u) 

g                g      S  g  6      & 

P Ju,  u]  -  1  -  a  (u)  P  [u,  q»]  -  a  (u)d  (q) 

8              g  S  g    & 

P  [u,  u'3  -  a  (u)d  (u)  g  e  G±   -'  {x,  y} 

P  [u',  u'  ]  =  1     if  k,  =  k„  =  u  V  k_  =  uT 

x  1    2    a    1    a 


P  [u*  u]  -  1      if     ^(k.  =  k  =  u  )  V  k,  ^  u' 
x  1    2    a     1    a 


py[u;  ur]  =  1       if       ki  *  k2  ~  "b  v  ki  "  "b 

py[u;  u]  =  i        if       m^  -  k2  =  i^)  v  kx  t  u^ 

Otherwise  P  [s  ,  s.]  =0,  g  e  G  ,  for  the  above  and  all  other  combinations 

of  s-  and  s2  (s  ,  s  e  I) . 

For  terminal  z,  transition  probabilities  are  given  by  the  following 
where  i_  and  i  represent  s  and  s-  for  terminal  x,  and  j   and  j„  those 
for  terminal  y: 


pzK>  V  ■ 1  pz[v  \]'x-  W 

w  ^  -  w-  V"b>  ">;)  -  wdz(t» 

<*>      w  u;]  *  az(<ib)d,(u)        w  v =  x  -  w 

Pz[V   "b3  ■  az<°.>  W   <]   =  az(ub)dz(<l) 

'^"b*   Ua'  =  0Lz<-x\?d   (u)  h  £  H  =  {a,   b} 

PzK'   Ua]  =  X  if  h  ~  h  "  % 

Pz[V   ua]  =  X  if  Mil  =  *2  '  ua> 

PzK«   %]  "1  "  h  =  h  =  % 

Otherwise  P   [s..  ,    s0]  =0  for  the  above  and  all  other  combinations  of  s- 

Z   1    Z  L 

and  s~  (s„  s.  E  K) . 

Let  S[i,  j,  k]  be  the  state  of  the  system  in  which  the  statuses  of 
terminals  x,  y,  and  z  are  i,  j,  and  k,  and  X[i,  j,  k]  be  the  probability 


of  the  system  being  in  that  state.   Then  we  have 

HE  X[i,  j,  k]  =  1 

ijk 

(5) 

X[i,  j,  k]  >  0  i,  j  £  I;  k  £  K 

Let  S[i  ,  j- >  k- ]  and  S[i  ,  j0,  k  ]  be  the  states  of  the  system  at 

111  L         £■         £. 

two  consecutive  epochs.      Then,    a  stationary  equilibrium  exists  between  each 
state,    S[i2>    j«,   k„],   and  various   states,    S[i    ,   j    ,   k   ],    leading  to   this 
state.     This   is  fommlated  with  transition  probabilities  in   (3)    and   (4): 

z         l         l     px[ir  i2]  W  h]  Vki>  k2]  «*i-  V  ki] 

(6)  ix  £  I  j1  £  J  kx  e  K 

-  X[i2,  j2,  k2]  =  0       i2,  j2  £  I;  k2  £  K 

Since  we  have  one  equation  for  each  of  the  possible  states  S[i,  j ,  k] , 
a  solution  to  the  set  of  equations  in  (6)  in  terms  of  X[i,  j,  k]  gives  us 
the  values  of  stationary  probabilities  for  the  system  to  be  found  in 
S[i,  j,  k].   From  the  stationary  probabilities  thus  determined,  the  proba- 
bility of  terminal  x,  y,  or  z   being  in  the  status  i,  j,  or  k,  denoted  by 
r  (i) ,  r  (j),  or  r  (k) ,  is  determined  as  follows: 

a.         y  £ 

r  (i)  =   E     E   X[i,  j,  k] 

1  £  I  k  £  K 


(7)       r„(j)  =   E        X[i,  j,  k] 

i,  j  £  I 


y      i  £  I  k  £  K 


r  (k)  =   E     E   X[i,  j,  k] 
z      i£lj£l  k£K 


Using  the  average  processing  rates  in  (2)  and  the  status  probabilities 
in  (7) ,  we  determine  the  expected  response  time  to  message  m  sent  by 


terminal  a  or  b  as  follows: 

(8)  T  (m)  =  {r  (m)  +  r  (m')}  t  (m) /r  (m)     m  e  M 

g       g       g      g     g 

8  £  Gl 

Similarly,  the  expected  response  time  to  a  message  sent  by  terminal 
z  is  given  by  : 

(9)  T  (m)  =  {r  (m  )  +  r  (mf))  t  (m  )/r  (m  ) 

z       z  a     2a    z  a   z  a 

+  (r  (t^)  +  r   (m^)}   tz(mb)/rz(mb)       m  e  M 

Case  II:   Two  Fields  Assigned  to  the  Same  Segment 

In  this  case,  fields  a  and  b  are  assigned  to  the  same  segment,  and 
therefore  one  message  from  terminal  z  can  process  both  fields.   This  en- 
ables us  to  represent  the  status  of  terminal  z  as  well  as  those  of  termi- 
nals x  and  y  by  an  element  of  set  I,  I  =  {q,  q1,  u,  u*  }.  An  intent  conflict 
will  be  created  if  any  two  terminals  send  update  messages  concurrently, 
because  their  fields  now  belong  to  the  same  segment. 

Let  the  average  process  time  and  the  probability  of  process  completion  for 

message  m  sent  by  terminal  g  be  given  by  t  (m)  and  a  (m) ,  then  we  have 

g        g 

(10)  a  (m)  -  1/t  (m)  g  e  G,     ra  e  M 

g        g 

When  there  is  no  intent  conflict,  the  transition  probabilities  of 
all  terminals  in  this  case  are  given  by  the  following  common  forms: 

Pg[q,  q]  =  1  "  ag(q),       P  [q,  qf ]  =  ag(q)dg(q), 
P  [q,  u1]  «  a  (q)d  (u),     P  [u,  u]  =  1  -  a  (u) , 

/^]A  S  8       g  6  6 


P  [u,  q']  =  a  (u)d  (q),  P  [u,  u']  =  a(u)d  (u), 

8  8     8  8  8 

Pg[q\  q!  =  1  g  e  G 

When  there  is  an  intent  conflict,  the  transition  probability  of  a 
terminal  is  given  by  one  of  the  following: 


Px[u',  u?]  =  1   if   (jx  -  uAJ2  =  u)Vj  -  u'V(k  =  uAk2  =  u)Vkx  =  u' 
Px[uf,  u]  =1    if  v<J1  =  uAJ2  -  u)VJ1  ^  u'VM^  ■  uAk2  =  u)Vk  *   u* 


(12) 


P  [uj  u' ]  -  1  if  (i  =  uAi£  =  u)V(k  =  uAk2  =  u)Vk  =  uf 

P  [uj  u]  -  1  if  Mi,  -  nAi  =  u)W(k  =  uAk2  -   u)Vk  4   u? 

Pju1,  u']  =  1  if  tt±  -  uAi2  =  u)V(J1  =  uAJ2  =  u) 

Pz[u',  u]  =  1  if  Mix  =  uAi2  =  u)V(J1  =  uAj2  =  u) 

Otherwise  P  [s_  ,  s?]  =  0,  g  e   G,  for  the  above  and  all  other  combinations 
of  s-  and  s  (s. ,  s  e  I). 

In  the  present  case,  the  stationary  equilibrium  condition  takes 
the  following  form  : 

(13)     E       I      I    Px[i   i  ]  F|J,  j  ]  P  [k^.  k2]  X(il>  J   kx] 
1     Jl      l 

"  XtV  V  k2]  =  °  V  V  k2  £  * 

subject  to 


Z  I  £  X[i,  j,  k]  =  1 
i  j  k 

X[i,  j,  k]  >  0  i,  j,  k  e  I 


8 


Using  a  solution  to  the  set  of  equations  in  (13),  the  expected  re- 
sponse time  to  a  message  from  each  terminal  is  determined  by  the  formulas 
in  (7)  -  (9). 
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